From Language to Family and Back: Native Language and Language Family Identification from English Text
نویسندگان
چکیده
Revealing an anonymous author’s traits from text is a well-researched area. In this paper we aim to identify the native language and language family of a non-native English author, given his/her English writings. We extract features from the text based on prior work, and extend or modify it to construct different feature sets, and use support vector machines for classification. We show that native language identification accuracy can be improved by up to 6.43% for a 9-class task, depending on the feature set, by introducing a novel method to incorporate language family information. In addition we show that introducing grammarbased features improves accuracy of both native language and language family identification.
منابع مشابه
Native Language Interference in Writing: A case study of Thai EFL learners
AbstractThe interference of the native language in acquiring a foreign language is unavoidable. In an attempt to explore the phenomenon why this occurs, the study was conducted in English as a foreign language writing. The study also investigated how the native language interference occurred in the writing process. In fact, this qualitative study explored the reasons and the process of na...
متن کاملNative Language Interference in Writing: A case study of Thai EFL learners
AbstractThe interference of the native language in acquiring a foreign language is unavoidable. In an attempt to explore the phenomenon why this occurs, the study was conducted in English as a foreign language writing. The study also investigated how the native language interference occurred in the writing process. In fact, this qualitative study explored the reasons and the process of na...
متن کاملUse of Articles in Learning English as a Foreign Language: A Study of Iranian English Undergraduates
The significance of error analysis for the learner, the teacher and the researcher is now widely recognized. Earlier studies of error analysis concentrated on intersystematic comparison of the “native language” and the “target language” and drew the required data largely from intuitions and impressionistic observations. This study was conducted on the basis of the following observations: (1) to...
متن کاملFinite element model updating of bolted lap joints implementing identification of joint affected region parameters
<span style="color: black; font-family: 'Times New Roman','serif'; font-size: 10pt; mso-fareast-font-family: 'Times New Roman'; mso-themecolor: text1; mso-ansi-lang...
متن کاملAn Investigation of Assessment Literacy Among Native and Non-Native English Teachers
The current study aimed at examining the relationship between English language teachers’ assessment literacy and their teaching experience. In other words, it intended to inspect the relationship between native and non-native English language teachers’ assessment literacy and their teaching experience. To achieve such goals, 100 native and non-native English teachers from ESL and EFL contexts w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013